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Abstract: We consider a diffusion (£t)t>o whose drift involves some T-periodic signal. T is fixed 

and known, whereas the signal depends on a d-dimensional parameter •& € 0. Assuming positive 

Harris recurrence of the grid chain (£fcT)fcew an d exploiting the periodic structure in the semigroup, 
O ■ 

we work with path segments and limit theorems for certain functionals (more general than additive 
functionals) of the process to prove local asymptotic normality (LAN). Then we consider several 
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estimators for the unknown parameter. 
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t-j- I We consider a problem of parameter estimation in a time-inhomogenous diffusion (£t)t>o whose semi- 
group has a T-periodic structure, and whose drift involves a signal -deterministic, continuous, of 
known periodicity T- which is parametrized by some d-dimensional parameter i5 G 6. Our main 
assumption on the process is positive Harris recurrence of the grid chain (£fcT)fcew which implies (as 
in [HK 10]) positive Harris recurrence of the path-segment chain 

X = ( X fc) fc , X fc := {£(k-i)T+s) 0<S<T 
CZ i 

taking values in C([0,T]): this allows for limit theorems using the path-segment chain. Our main 

statistical assumption is that the derivative of the signal with respect to the parameter is again a 

T-periodic function, and that the parametrization is sufficiently smooth, in a suitable L 2 -sense related 

to the periodic structure of the semigroup. 

We prove local asymptotic normality (LAN) for the sequence of statistical models corresponding to 
observation of a trajectory of £ - with unknown - over a long time interval [0, nT], and state 
a local asymptotic minimax theorem. See LeCam ([LC 68], [LC 90]), Hajek [H 71], Ibragimov and 



Khasminskii [IH 81] for background on LAN and convergence of estimators, see also the survey paper on 
local asymptotics by Davies [D 85], and see Liptser and Shiryaev [LS 81], Jacod and Shiryaev [JS 87], 
Kutoyants [K 04] on background on statistical models and likelihood ratio processes in diffusions. 
'Signal in white noise' problems seem to appear first in Ibragimov and Khasminskii [IH 81]. The 
limit theorems which we use for time-inhomogeneous diffusions (£t)t>o with periodic structure in the 
semigroup are from [HK 10, Section 2]. 

We then consider a sequence of minimum distance estimators (MDE) which via LeCam's one step 
correction can be improved to get an asymptotically efficient estimator sequence, thus providing an 
easy-to-calculate and explicit alternative to maximum likelihood estimators and Bayes estimators 
which under some conditions are asymptotically efficient for the unknown parameter. 



1. Local asymptotic normality 

Our setting is as follows. The observed diffusion is inhomogeneous in time 

(1) di t = t) + 6(6)] dt + <r(tt) dW t , t > , Co = x 

and depends on an unknown parameter d ranging over an open set G C M d . The drift in (pQ) involves 
a deterministic signal S(-, •), which is a continuous function 0x[O,oo) — > M satisfying 

(2) for every ■& G O fixed: S($, •) : [0, oo) — > M is T-periodic 

for some period T (fixed and known, and not depending on We write throughout the paper 

iT(t) '■= t modulo T . 

The functions b(-) and cr(-) are assumed Lipschitz, and a(-) strictly positive. Hence for all values 
of i? £ 0, we have Lipschitz and linear growth conditions for the time-dependent coefficients of 
equation (TjQ), and thus existence and pathwise uniqueness for its solution ([S 65], [KS 91]). We discuss 
'ergodicity properties' for the time inhomogeneous diffusion f. Write (P$)o< s<t<oo for the semigroup 
of the process (H|). As a consequence of ([2]), this semigroup is T-periodic in the sense 

(3) Psf(x, dy) = Ps% Ttt+kT (x, dy) for all k G IN and all < s < t < oo . 
We assume 

for every # G 0, the grid chain (6cT)fceJV is positive recurrent 

(HI) ' 

in the sense of Harris with invariant probability /x' ' on (M, B(IR)) 
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and decompose the path of £ into a Markov chain of T-segments 

X = (X k ) k defined by X k := (£( k -i)T+s)o<s<T , k > 1 

which takes values in the path space (Ct,Ct) of continuous functions [0, T] — > M, using as initial value 
Xo some a G Ct such that a(T) = xq. As a consequence of ([3]), the chain X is time homogeneous. 
As a consequence of (HI), X will be positive recurrent in the sense of Harris under •& (see [HK 09, 
theorem 2.1]), with invariant probability on (Ct,Ct) whose finite-dimensional distributions are 
given in terms of /i^ and , < s < t < T. In order to obtain limit theorems for logdikelihoods 
related to observation of the time-inhomogeneous process £ over a long time interval, for asymptotics 
of local models at i?, or for rescaled estimation errors of interesting estimators, we shall always exploit 
the ergodicity of the time homogeneous Cy-valued chain X. 



We turn to properties of the parametrization and list the properties of the parametrization which we 
shall need. Our identifiability condition is 



(H2) 



for all i? G 0, e > 0: inf sup |5(£ s) - S(0, s)| > . 

|C-0|>£ 0<s<T 



A sufficiently general differentiability condition is as follows. Restrict the continuous periodic functions 
{£(£, •) : £ G 0} to [0,T], assume that all measures 



u Wp w (—) 



li- 



ds on ([0,T],S([0,r])) , i5e0 



(4) A W (ds) := 

are finite, and consider spaces 

ElW = L 2 ([0,T],fi([0,T]),AW) , ^9 
Assume that for every $ 6 0, there is a function 



, G HW for i = 



v too / 



such that the following holds: 
(#3) 



/O e (i?, •) : = sup 
IC-*l<e 



s(c,-)-s(*,-)-(C-fl) T s(fl,-) 



G #) for some e = e(0) > 



p £ (i?, •) — > in #) as e | . 

This is slightly more than Frechet differentiability of £ — > S(C, •) at £ = $ in the Hilbert space 
The assumption that the derivative with respect to the parameter t — > S(i),t) is again a T-periodic 



function -the key to the results presented here- is a strong assumption; e.g., it rules out estimation 
of an unknown periodicity ('frequency modulation') as considered in [IH 81, p. 209] or [CLM 06]. We 
call the dxd matrix 

(5) J<*) := / 5(#,s) S($,s) T \ W (ds) 

Jo 

Fisher information at $, and add the assumption 

(HA) jW is invertible for aU •& € 6 . 

We introduce two additional assumptions. For S := {a2~ k : k G Mq , a £ Z5 d } the set of dyadic 
numbers in ]R d , we shall need 



(H5) 



sup 

C6Sne,|c-i?|<e 



S((, •) - S(#, •) e for some e = e(0) > 

inHW as e|0 



for all t? 6 together with 



(2T6) 



sup 

fesne, |C-i?|<e 



/(C) _ /W 



as e 4 



Let denote the law of the solution of dTJ) -h (J2J) under i? € ©, a law on the canonical path space 
(C,C) of continuous functions [0, oo) — )■ equipped with the metric of locally uniform convergence. 
Write Qf for the restriction of to the a-field Gt of events before time t + , and® = (Gt)t>o- The 
likelihood process of Q$ with respect to Q" 9 relative to<£ (cf. [LS 81], [JS 87], [K 04]) is given by 

(6) L c, =e Jf s«.t)-s m dmf \ = / I s(t,t)-s(e.t) w \ 

V7o / \Jo (?{Vt) J 

where rj = (r]t)t>o denotes the canonical process on (C,C,(S), m^' its ((E, Q^)-martingale part, and 
where := f Q ^ \ dmi is a (Q^ffl^-Brownian motion. The following is a '2nd Le Cam lemma' 
for time inhomogeneous diffusions of type (flj)+ (j2"|) : 

1.1 Theorem : Under (H1)+(H3), in the sequence of experiments 

(C, GnT, [Qi T ■ C 6 ©}) , n^oo, 

we have LAN at every point i? £ 9, with local scale n _1//2 and Fisher information given by ([5]). 
More precisely, for every £ 6 and arbitrary bounded sequences (h n ) n in we have a quadratic 
decomposition of log-likelihood ratios 

(7) logL^ 172 ^ = hl^) - \hll^h n + o Q »{l) asn^oo 
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with score 



r (Vs 



such that 

(8) £(AW I Q*) — ► AT(0, /W) (weak convergence in n — >• oo) 



The proof of theorem 1.1 will be given in section 2 below (see 2.1). For background on LAN see [L 
69], [H 70], [IH 81], [D 85], [LY 90], [K 04], or also [H 08] for some more details. A well-known and 
powerful consequence of LAN is the local asymptotic minimax theorem (corollary 1.1' below, from 
[L 90, p. 83]; a result using neighbourhoods in a different way is [IH 81], with additional uniformity 
assumptions which we do not make here). Fix any reference point $ € 0, write 

(9) ZW .= (/W)-lA£» , „>1 
for the central sequence in the sequence of local models 

:= {P„,tf, h := Q^+ n " 1/2/l : h € M d such that $+n~ l ' 2 h G @| , n > 1 
at i?, and let £(•) denote any loss function which is continuous, subconvex and bounded. Then 

1.1' Corollary : (LeCam [L 90, p. 83]) Under (HI) and (H3)+(H4), the following holds. For any 
sequence of £/ n T-measurable estimators i? n , for any $ G ©, 

sup liminf sup £W n -i/ 2ft) (l U 1 ' 2 ^ - (0 + n'^h)))) > f°° £(z)Af(0, (jW)" 1 )^) . 

c<oo ri^oo | fe |< c V \ // J_ QO 

Sequences of £/ n T-measurable estimators with the property 

(10) ra 1/2 «-tf) = + 0q *(1) asn^oo 
attain the local asymptotic minimax bound at i9, and have 

liminf sup E {i)+n . 1/2h) (l (n 1 ^ - (0 + rT^h)))) = J °°J(z) M(0, (I^)" 1 )^) 
for every < c < oo. 



Estimator sequences attaining the local asymptotic minimax bound at are termed efficient at 
The criterion ()10p characterizes efficiency at $ in terms of stochastic equivalence under •& of rescaled 
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estimation errors at i? and central sequence at Using Le Cam's 'one step correction', efficient 
estimator sequences can be constructed explicitely provided we dispose of at least one preliminary 
estimator sequence which is n 1//2 -consistent. The construction which we give here follows Davies [D 85]. 

Theorem 1.2 : Assume (HI), (H3)+(HA), (H5)+(H6), and let t? n denote some sequence of 
(? n T-measurable estimators with the property 

for every i5e9: C (n 1/2 (d n - tf) | Q w ) , n > 1 , is tight in M d . 

d 

For a = (aij.-.jOd) G % d and k £ Wo, consider cubes C(fc,a) := X [a,-2" fc , (a,-+l)2- fc [ in iR d . 

i=i 

Fixing some point 4 £ 9, we discretize the estimator # n to 

K := #0 + ^ (a2- ft -<? )lc(«,a)(£») 

qG* s.t. c(n,«)ce 

which takes a countable number of values in 0. Then 

C := ^„ + n- 1 / 2 (#"))- 1 A(f") 
is a sequence of £/ n r-measurable estimators for & G which has the property (fTUj) at all points # € O. 

Thus by corollary 1.1', the sequence (#£) n constructed in theorem 1.2 is efficient at all points i9 G 0. 
Since sequences meeting (|1U|) are regular at i9 in the sense of Hajek ([H 70]), they are also regular and 
efficient in the sense of Hajek's convolution theorem. 



1.3 Example: We look to equations (HD of Ornstein-Uhlenbeck type 

b(x) = —f3x for some (3 > , a(-) = a > constant . 

1) It follows from [HK 10, example 2.3] that (HI) holds for arbitrary continuous and T-periodic 
functions t — > S("&,t), $ G 0. The diffusion coefficient being constant, the measure AW in © is 
Lebesgue measure on [0,T] multiplied by the constant -j. It does not depend on # G G. 

2) Let C M d be open, let (K m ) m denote an increasing sequence of compacts with K m C int(K m+ \), 
m > 1, and © = U -fiT m . For ever y t G [0, T] fixed, let the mapping ( — > S((,t) be C 1 on ©. Then 



(11) for every m : sup sup sup u S(£,t) 

te[o,T] (€K m \ u \=l 



< oo 



is a sufficient condition implying (H3) + (H5) + (H6). Under (jlip . at every point $ G 0, 

r-T 



Cex m C-v Jo 



provided m is large enough for i? G int(K m ), and we may consider the following partially converse 
condition: for all i9, assume that there is some e = e(?9) > such that 
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(12) Ff^ I ("«.*)-*<*.«»•* > o. 

Then under §TQ), condition flE) implies both (J?2)+(iT4) together. Hence, if C -> S^C*) is C 1 on 6 
for fixed i, (HU+O are sufficient conditions for (H2)-(H6). 

3) Fix some continuous T-periodic function /(•), non-constant. Put := (0, T) x (0, oo), write 
•& = (i?i,??2)) an d define (with notation ^t(^) for i modulo T as above) 

S(0,t) := 2 /(»r(t)-0i) , t E M . 
Then (|ll|) + (|12p and thus (H2)-(H6) hold. By T-periodicity of /(•), the Fisher information in (0) is 

a2 V -Mcfl/'/K*)*' / T / 2 ( s )d s 

which does not depend on $i G (0, T). 

4) Assume that T is large compared to 2, put = (0,T), and define 

:= with /(x) = l {M0 |< 1} (l-|i r (t)l) , ^ . 

Then assumptions (i?2) — (#6) are satisfied with 

S(#,t) = (l (w) - (»r(t)) , feiR 

and Fisher information 

= 4 , * e 

not depending on i? G 0. Similiarly, if we replace the definition of / above by 

f(x) = l {liT{mi} (l-\i T (t)\) a , t£M 

with some a > 1, then assumptions (i?2) — (#6) are satisfied: we have 

S(#,t) = g(t-#) with g(t) := a(l-|i T (t)|r" 1 (l(o,i) - l(-i,o)) M*)) 

and the Fisher information does not depend on G 0. This is analogous to some examples in 
[L 90, p. 32]: §(■&,-) as element of #) is defined for a > ^, (#3) holds for a > | by dominated 
convergence, but (#5) is violated in case 1 > a > |. □ 
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2. Proofs 



In this section, we assume that the strong Markov process £ lives on some (CI, A, IF, (Px)xem)', 'almost 
surely' means almost surely with respect to every P x , x G ]R; is the filtration generated by £, and 
an iF^-increasing process is an iF^-adapted cadlag process A = (A t )t>o with nondecreasing paths and 
Aq = 0, almost surely. 

(Ct,Ct) ~ the space of all continuous functions a : [0,T] — > M with Borel <r-field Ct - is a Polish 
space, and Ct is also generated by the coordinate projections n t , < t < T. The one-step-transition 
kernel Q(-, ■) of the chain X of T-segments in the path of £ is given by 

Q(a,F) := P ( fe)o< s <r 6 f | 6 = a(T) ) , a G C T , F € C T . 

For Harris processes in discrete time, we refer to Revuz [R 75] or Meyn and Tweedie [MT 93]. For 
Harris processes in continuous time see Azema, Duflo and Revuz [ADR 69] or Revuz and Yor [RY 91, 
Ch. 10.3]. The following (with sub- or superscripts # suppressed from notation) is a precise statement 
of the 'ergodicity properties' mentioned in section 1. 



Theorem A: ( [HK 10, theorem 2.1]) Assume that the grid chain (£fcT)fcew is positive recurrent in 
the sense of Harris, and write fi for its invariant probability on (M, B(M)). 

a) Then the chain X = (Kk)kew of T-segments in the path of £ is positive recurrent in the sense of 
Harris. Its invariant probability is the unique law m on (Ct,Ct) such that 



(13) 



for arbitrary = t < h < ... <t t < t i+1 = T and Aj G B(M) , 
m ({TTti G Ai , < i < ^+1}) is given by 
^ / . . . / fi(dx ) l Ao (x ) nLo P U,U+i ( x i, dx i+l) l A t+1 (x i+ i) ■ 



b) For every iF^-increasing process A = (A t )t>o with the property 



(14) 



there is some function F : Ct — > JR, nonnegative, C^-measurable, 
satisfying m(F) := f c F(a>) m(da) < oo , such that 

A kT = = E F ((£ (fe _i)T +s )o< s <r) , k>l 

3=1 3=1 



we have the strong law of large numbers 



lim — At = — m(F) almost surely . 

t— >QO t T 



This strong law of large numbers will be the key tool to prove theorem 1.1. We use the notations 
which have been introduced in section 1. 



2.1 Proof of theorem 1.1 : We assume (HI) and (H3). 
1) Fix i?. For h G M d , consider 

lo L ^+ n - 1/2h )/» + »~ 1/2 M) - dB y) _ 1 f nT ( S($ + n- 1 / 2 h,s)-S($,s) \ 2 

° S nT ' Jo <Vs) s 2 Jo { <r(Vs) J S 

under . Adding ±n~ l l 2 hJ S^, •) in the numerators of the integrands, we separate leading terms 

(15) h T A^($) - hi T r+{$)h 
defined by 

(16) w ^-LffM^. , t >_ 



from remainder terms 



(18) N$(h) - hffl£(h) - Vp(h) 
defined by 

(19) := T W+^Ml-W.)-.- 1 ^,.) dB{f) t > 

Jo trWi) 

(20) Um = jf" ^ + n-y%s)-S^s)-n'^S(^ ' ^ 

(21) v t [h) .- y o ^ d S . 

We shall show convergence of in (|1T[) to the Fisher information 

Jo 

Q^-almost surely as n — > oo, weak convergence of C (Aj.(i?) | Q^) in (fT6j) to JV(0, JW), as well as 

(22) iVr(M , , VP(h n ) — ► in (^-probability as n -> oo 



for arbitrary bounded sequences (h n ) n in M d . 

2) Consider first the processes {I't('&))t>o defined in (fT7|) . Write Fjj> : Ct — > JR for the function 

' ' Sj{#,s)Sj>{ti,s) 



a ^ Fj ' Aa):= l { **(<*(*)) ) dS 
with arbitrary 1 < j,f < d. By definition of \^\ds) in © and since S0, ■) £ by (iT3), 



(23) = / %'(a)m (tfJ (da) = / S j (-&,a)S j t(-&,8)X^{da) 



T 



' Ct 

is well defined and finite, and equals [/W] .,. Combining (|17p with theorem A , we obtain 



(24) [im] jt i = ^E^'(^) mW(^) = [jMj . 

j> ^ t — < L J 7,7' 

almost surely as n -> oo. This is Q^-almost sure convergence of to the Fisher information 



3) We consider the processes in (|16|) : (A™(0))t>o are martingales with angle brackett {I^{9))t>o 



relative to and (G n t)t>o- For any (3 G iR d fixed, the function F(a) = Y^j j' PjFj j'( a ) Pj' is 



nonnegative and m^-integrable by step 1) above, we have 

I tn j r tn 1 

i^F(X fc ) < /3 T W)/3 < F(X,) 
n n z — ' 

k=l k=l 
and thus by theorem A, for every u > fixed, 

(25) lim f3 T lZ T ($)f3 = u- r (p T S0&,s)) 2 \<&(ds) = u ■ f3 T 1^ (3 

rn-oo Jq \ ) 

almost surely as n — > oo. From (|25p . the martingale theorem [JS 87, VIII. 3. 22] gives weak convergence 
in the Skorohod path space D as n — > oo 



where -B($) is d-dimensional Brownian motion with covariance matrix jW. But /3 G iR d was arbitrary. 
So we have in particular weak convergence in M d as n — > oo 

c(a t {#) I Q 1 *) — ► AA(o,/W) . 

All assertions concerning convergence of the leading terms f)15|) are now proved. 

4) We turn to the remainder terms in (fTBj) and fix any bounded sequence {hn)n in We start with 
processes (|20p and write, for iV arbitrarily large and fixed, 

nN 

k=l 
10 



where G(-,n,h) : Ct — > [0, oo) is the function 

f T (S{$ + n-^h, s) - S(0, s) - n-WhJSiti, s) \ \ 
a — > G(Q,n,/i) := / ; — — as . 

Jo \ ^("(s)) J 

Define with p e (ti, •) G introduced in (H3) 

G £ {a) := f T ( Pe ^; S l \ ds , a G C T , e > . 
For (h n ) n bounded by c, (H3) allows to find for every e > some no such that 
(26) G(a, n, h n ) < c 2 rC x G £ (a) for all a G Ct and all n > no . 

For any e > fixed, the strong law of large numbers in theorem A combined with (|26p shows that 

nN 

limsup U% T (hn) < lim c 2 -VG £ (X fc ) = c 2 Nm^(G £ ) 

n->oo n f— f 

fe=l 

almost surely. But the second part of assumption (H3) shows that 

T 



m W(G e ) = f G £ (a)m W (da) = [ p 2 £ ($,s)\ W (d 
Jc T Jo 



as e decreases to 0. Both last assertions combined give 

(27) lim Uj!j T (h n ) = almost surely, for arbitrary fixed N fixed . 

71— s-oo 

The processes {U™{h n ))t>o being increasing, this implies the following property of the paths: 

(28) U,{h n ) vanish uniformly on compact time intervals, Q^-almost surely as n — > oo . 

5) Next, {Nt~(h n ))t>o defined in (fT9|) are martingales with angle brackett {Up(h n ))t>o relative to Q 
and {Gnt)t>o- Hence for every n, the process 



sup \N^{h n )Y 

0<s<t / t > 

is dominated in the sense of [JS 87, p. 35] by (U™(h n )) t>0 . As a consequence of step 2) this implies 

(29) N™(h n ) vanish uniformly on compact time intervals, in Q^-probability as n — > oo . 

6) Finally, the reasoning of step 4) combined with Schwarz inequality and step 2) shows that 

(30) V™{h n ) vanish uniformly on compact time intervals, Q^-almost surely as n — > oo . 

Hence all remainder terms in ()22p vanish, and the proof of theorem 2.1 is complete. □ 
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2.2 Proof of theorem 1.2 : We assume (HI), (H3)+(HA), (H5)+(H6). The proof follows the 
approach of Davies [D 85, p. 849]. 



1) In a first step, we shall show that for every i? G and every c < oo we have approximations 

(31) sup A^-(aW-I(*V /2 (C-0)]) = °Q*(1) asn^oo 
Ce5ne, |c-i?|<cn-V 2 v 7 

with S := {a2~ fc : E IVq , a G Z d } the set of dyadic numbers. 

From (jSJ) and theorem 1.1 we have for points ( 6 5n6 such that |£ — i?| < en -1 / 2 

ACO = -1- r^dm® 

(32) = ^ r ^ ^ - Tn r s m (5(c ' s) - s)) ds 

according to Girsanov theorem. The Q^-martingale term on the right hand side of (|32p is 

as n — > oo: this follows if we apply (H5) and theorem A to angle bracketts of 

dmi , n — > oo . 



i r T s(c,a)-s(0,s) 



n Jo ^(Vs 



Rewrite the bounded variation term on the right hand side of (j32|) as 



n Jo cr 2 (7] s ) 

+ ^ 0(c,)-w.r lv , (c _, )]d , 

+ U — ^) l^-^l 

The first summand equals 

jW[VH(C-<?)] + 0<? *(l) 

by definition of the Fisher information and theorem A. The second summand, by Cauchy-Schwarz 
combined with (H5), ([5|), theorem A and |£ — $| < ere -1 / 2 , vanishes in (^-probability as n — > oo. The 
third summand vanishes in (^-probability as n — > oo by Cauchy-Schwarz combined with (H3), 
theorem A and |£ — i?| < en" 1 / 2 . The forth summand behaves similiarly: we make use of (H5), (H3), 
theorem A and |£ — i?| < cji -1 ' 2 . Hence we have proved the decomposition (|3ip . 

12 



2) Exactly as in [D 85], decomposition (|3ip combined with assumption (H6) concludes the proof of 
theorem 1.2. Thanks to (H6) + (HA) and -^/n-consistency of the discretized estimator sequence (i? n )ru 
we have an information with estimated parameter such that 

/(*») = iW + 0g# (i) and )"! = (jW)-l + 0g *(l) . 

Exploiting ([3T]) and -^/n-consistency of ($ n )n, we have a score with estimated parameter satisfying 

Af) = AW - jW[nVa + 0Q ,(i). 

Hence by definition of 

r n = d n + n -V2(j(^))-iA^») 

we have immediately from (|3ip 



for the central sequence defined in ([9]). An application of (|10p in corollary 1.1' finishes the proof. □ 



3. A minimum distance estimator sequence 

We consider a sequence of minimum distance estimators (MDE) in the sense of Millar ([M 81], see also 
[Ku04, Chapter 2.2], [H 08, Chapter 2.B+C]). We write H for the Hilbert space L 2 ([0, T], B([Q, T]), A) 
with Lebesgue measure A and assume in this section 

(33) o" 2 (•) is bounded and bounded away from 

together with (H1)+(H3); slighly modified analogues to (H2) and (HA) will come in below. 

To a path segment = ( £(k-i)T+s )o<s<T we assoc i a t e 

Y k := £ {k _ 1)T+s - 6 fc _i) T - / &(&.) dr 

V J(k-l)T 



0<s<T 



and 

/ r-(k-l)T+s 

V fc := / a(e,)o?W r 



(fc-l)T 



0<s<T 
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for k > 1. Based on observation of a trajectory of the process (pQ) with unknown i9 up to time nT, we 
define empirical quantities 



1 

W n := - V Y k , n > 1 

n 

k=l 

and compare these -thanks to T-periodicity of the signals- to 

* c := ( [ S dvS((,v)) 

\ Jo / 0<s<T 

for all possible values of the parameter (£6. We have the representation 

/ i n r(k-l)T+s \ 

(34) * n = + - / v(£r)dW r 



0<s<T 



under Q , and prepare on some probability space a time-changed Brownian motion 



(35) W = (Bo$( S )) 0<s<T , *(«) := [ S dvL^P^a 2 

Jo L 



, < s < T 



With the usual conventions, our MDE sequence for the unknown parameter $ E will be 

(36) t? n := arginf ||$ n - tf f || H , n > 1 . 

Cee 

We strengthen the identifiability condition (H2) to 

(37) inf \\t> c -t>A H > 

Cee,|C-<?l>e 

for all # E 0, and have by means of theorem A in section 2 a weak law of large numbers 

(38) || \P n — ^ ||# — )■ in probability under as n — )• oo 
together with an integrability property 

(39) 5 up£^n(i( S )-^( S )) 2 j < Cs , Hm £7,, (n(* n (s) -tf*(*)) 2 ^ = *(a) 

for every < s < T. By finite-dimensional convergence combined with (|39p . the processes (|34p under 
Q converge weakly in the path space H to the Gaussian process W defined in (|35p . see e.g. Cremers 
and Kadelka [CK 86]. This implies ([M 81], [Ku 04], or [H 08, 2.13]) that the MDE sequence fl3SD 
converges in Q^-probability to the true value d and is tight at rate y/n , for all t? E 0. Thus we 
dispose of a preliminary estimator sequence for "& which is tight at the required rate. Already at this 
stage, we can apply theorem 1.2 and obtain an easy to calculate modified estimator sequence which 
is efficient in the sense of the local asymptotic minimax bound given in corollary 1.1'. If in addition, 
for all •& E 0, the components Sj(-d, •) of the derivative in (H3) satisfy 

(40) Djty$ = I / Sj(j&, v ) dv I , 1 < j < d , are linearly independent in H 

\Jo J 0<s<T 
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there is an explicit limit law for rescaled estimation errors ([M 81], [Ku 04], or [H 08, 2.23]) of the 
MDE sequence. Summarizing we have 



3.1 Theorem: Under (Hl)-(H6) and ([33|) + ([3T|) + ([4TI|) . we have for all $ G weak convergence 

C (Vn(& n ~ 0) I <3*) — ► AA(0, A^H^A" 1 ) 

in M d as n — > oo, where A$ denotes the dxd-matrix with entries Jq Di^f^(s)Dj^f^(s)ds, and S# 
the cZxfi-matrix with entries Dj*^(si)$^(si A S2)Dj^f^(s2)dsids2 for <j?,y defined in ([3"5]) . 

One-step modification according to theorem 1.2 transforms the MDE sequence ($ n ) n into an esti- 
mator sequence ($^)n which is asymptotically efficient at all points i? G O in the sense of corollary 1.1'. 

Other estimator sequences do exist which are efficient in the sense of the local asymptotic minimax 
bound of corollary 1.1'. 



3.2 Remark : In addition to (f3"3"j) . let be bounded and assume that 

(41) C|C'-C| 2 < [ (S((',t)-S((,t)) 2 dt < C\C'-C\ 2 forallCC'e© 

J o 

for suitable constants < C_, C < oo (compare (|4ip to (|lip + (|12p ). Then maximum likelihood (MLE) 

= argmaxL^ , n G N 



(with £o G O some fixed point, and the closure of 0) and Bayes estimator (BE) sequences 

, n G M 



#(*,2) ._ je_( L nT° d C 



Is % & d( 
will be efficient in the sense of corollary 1.1'. 

The main ideas can be found in Ibragimov and Khasminskii [IH 81], LeCam and Yang [LY 90], 
Kutoyants [K 04]. They have to be adapted to the time inhomogeneous setting with T-periodic 
semigroup which is the case here. Without going into details, we mention that under (|33p + (|4ip there 
are bounds for Hellinger type distances and affinities of form 



(42) 



(43) 



£ c sup L\ 

\0<s<t 



C7C 



< c + Ci 



Er 



1 - L 



-eve 



1/2' 



1 2 N 



< 



3 



+ i) IC'-CI 2 

+ i) J |C-ci 



2.7 



15 



(44) 



Er 



L 



C7C 



1/4' 



< 



5 

J'=2 



+ i ic'-ci 



2.7 



(45) 



C7C 



1/2 



< 



cxp 



with positive constants cu, which do not depend on (,(' 6 6 or on i > 0: based on ideas from the 
references above, a proof similiar to [HK 10, proof of lemma 5.1] goes through with 

S(C,s)-S(Cs) 



5, :-- 



cd(s) < 5 2 s < cd(s) 



C 



d(s) :=\S((>,s)-S((,s)\< 



+ i) ic'-ci 2 



IC'-CI 2 < f d(s)ds < C 
Jo 

where we exploit (|33| ) + (|41|) . with suitable constants. Combining (|43p - (|45|) with convergence of ex- 
periments according to theorem 1.1, one follows the approach of Ibragimov and Khasminskii [IH 
81] towards convergence of MLE and BE together with their moments of arbitary order. One can 
also use LeCam's 'Third lemma' to consider these estimator sequences under contiguous alternatives. 
For the sequence of local models at a reference point i? defined according to (|7|) in theorem 1.1, let 
£ = {P u : u G M d } denote the Gaussian limit experiment with central statistics u*. Then u* is the 
MLE in the limit experiment, and also the BE ([IH 81, p. 180], [K 04, p. 134]). Thus MLE and BE 
in the Gaussian limit experiment £ are equivariant estimators, the law of their error under arbitrary 
u £ M being equal to C(u*\Pq). For both sequences, we thus obtain 



(46) lim sup 

n ^°° \u\<C 



(' (» 1/2 (*&° - ($+n- 1/2 u)))) - E Po (£(u*)) 







for arbitrary loss functions which are continuous subconvex with polynomial majorant, and for 
arbitrarily large constants C. Combined with corollary 1.1', (|46|) is efficiency of both the MLE and 
the BE sequence, simultaneously for a large class of loss functions, in the sense of the local asymptotic 
minimax bound of corollary 1.1'. 
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